IOTECHNOLOGY ST.i^ . 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: f & ' ' 



Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: natin21helD@usnto.gov or phone 703-306-41 19 (R. Wax) 
PATENTIN 3.0 c-mail help: patin3heln@uSDto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§ 1 .82 1 - 1.825 effective October I, 1990(old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST 25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOV 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can he down loaded from t hp 1JSPTO website at the following address: 

http://www.uspto.gov/web/offices/pae/checkcr 



Raw Sequence Listing Error Summit? 



d <* fast 67>J }t -^: 



ERROR DETECTED 

ATTN: NEW RULES CASES; 

1 Wrapped Nucleics 

Wrapped Aminos 



SUGGESTED CORRECTION 



SERIAL NUMBER:, 



: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED DV PTO SOFTWA1 

The numberAext at Che end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating ft. Please adjust your right margin to .3; this will 
prevent "wrapping.*' 



Jnvalid Line Length The rules require that a line not exceed 72 characters in length, this includes white spaces. 



_PatenlIn 2.0 
"bug" 



_Skipped Sequences 
(OLD RULES) 



_Misaligned Amino The numbering under each 5* amino acid is misaligned Do not use Ub codes between numbers; 
Numbering use space characters, instead. 

Non-ASCII The submitted f.te was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission Is saved In ASCII text. 

V . , \ ' 

Variable Length. Sequence^) contain n's or Xaa's representing more thi>i one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number or each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patcntln version 2.0 has c^ed foe <220>^23> section to be missing from amino acid " 

sequencers) . Normally, Patenlln would* autonulicair^generate this section from the 

previously coded nucleic acid sequence. Please manually copy th« relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies (o the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequences) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID.NO where "X" is shown) 
This sequence is intentionatly skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 

Sequences) missing. Iflnlcntlonal, please insert the following lines for each skipped sequence. 

<2 1 0> sequence id number 
<400> sequence id number 
000 

Use or n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 ofSequence Rules, use O r<220>-<223> is MANDATORY tfn's or Xaas are present. 
In <220> to <223> section, please explain location or n or Xaa, and which residue n or Xaa represents. 

' Per 1.823 ofSequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scienlificnamc (Genus/spocies). <220>-<223> section is required when <2 1 3> response is Unknown or 
is Artificial Sequence 

I [ use of<220> Sequences) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use or <220> to <223> is MANDATORY if <2 13> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source or genetic material in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104. pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

12 Patcntln 2.0 Please do not use "Copy to Disk" function of Patcntln version 2.0. This causes a corrupted file, 

. -bug" resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 

listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 

13 _. Misuse of n ' n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 




_Skipped Sequences 
(NEW RULES) 



Use of n's or Xaa's 

(NEW RULES) 



Invalid <2 1 3> 
Response 

Use of<220> 



type of errors shown exist throughout 
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RAW SEQUENCE LISTING DATE: 09/19/2001 

PATENT APPLICATION: US/09/786 , 675A TIME: 10:20:21 

Input Set : A:\BB-1240 Seq List - Corrections.txt 
Output Set: N:\CRF3\09192001\I786675A.raw 



3 <110> APPLICANT: Cahoon, Rebecca E 

4 Miao, Gou-Hau 

5 Powell, Wayne 

7 <120> TITLE OF INVENTION: Plant Farnesyltransf erases 

9 <130> FILE REFERENCE : BB-1240 UOeS Not Comply 

ii <i40> current application number: us/09/786 , 6 75a Corrected Diskette Needed 

C--> 12 <141> CURRENT FILING DATE: 2001-08-29 

14 <150> PRIOR APPLICATION NUMBER: 60/099,521 

15 <151> PRIOR FILING DATE : 1998-09-08 
17 <160> NUMBER OF SEQ ID NOS : 23 

19 <170> SOFTWARE: Microsoft Word Version 7 . OA 

21 <210> SEQ ID NO: 1 

22 <211> LENGTH: 1426 
2 3 <212> TYPE: DNA 

24 <213> ORGANISM: Zea mays 

26 <400> SEQUENCE: 1 

27 gcacgagaca gcgcaattac ttaagctatt tgtattcgga tctgatccaa ccctggtggt 60 

28 cagctggact catcgcccat ggagcacact aagtcaggcc ccagcagttg gccagaactg 120 

29 gccgacgtgg tgccggtgcc gcaggacgat gggcctagcc ctgtggtgtc catcgcctat 180 

30 cgagatgact ttcgtgaggt catggattac ttccgcgccc tctacctcac cggtgagcga 240 

31 agccctcgcg ctctccgcct caccgccgag gccatcgagc tcaaccccgg caactacact 300 

32 gtctggcatt tccggcgcct tattctggag tcactagatt ttgatttact agaggagatg 360 

33 aaatttgtcg aaaaaattgc tgaatgcaat ccaaaaaatt accaaatctg gcaccataag 420 

34 agatggcttg ctgagaaatt aggacctggt attgcaaaca aagagcatga attcacaatg 480 

35 aagatacttg ctattgatgc aaaaaattat catgcttggt ctcataggca gtgggttctt 54 0 

36 caagcgttgg ggggatggga gactgaatta gaatactgtg accacttact taaggaagac 600 

37 gtcttcaata attcagcttg gaatcagaga tactttgtta taacaagatc accatttctt 660 

38 ggtggccttg cggcaatgcg tgattcagaa gtagactaca caattgaagc tattctagca 720 

39 aacgctcaga atgaaagccc ctggaggtac ctcaagggtc tatacaaggg tgagaataac 780 

40 ctgctagtag aggacgagcg catctctgct gtttgtttca aggtcctgaa gaatgattgg 84 0 

41 acttgtgtat ttgctttgag tttgctgctc gatcttctct gcactggttt gcagccttca 900 

42 gatgaactta ggtccactct tgaaacaata aggagctccc atcctgaaac cgcggatgat 960 
4 3 gatcctgcag ccgctgtttg ctgtatcctg cagaaatgtg atcccctgcg ggtaaattat 1020 
44 tggtcttggt tcaaggacac tctttctcag atctcatgac ttcacatggg ttcacccctt 1080 
4 5 gtccgcgctg gtccgggctc tgtgagatag acatgtttta gatagtttca ttggacaccc 1140 

46 aaacagagcg gacagagtgt atggctgcta ccttctccgt gactgaaagc agtgcttgta 1200 

47 acgattttgt ttagtaaaat ttgtgagtgt tactgctcca aacaacacct tatgcaacca 1260 

48 tatttgaata tttcacatgt aagcttgaat ccaggtgtgt ttgttaatgt attacaattg 1320 

49 ccatgggagc ctaaatgaga cccataatca cttccactag agtcggaaga ccgtgtcgag 1380 

50 cagttcactc atatggtcac ttaaagcaaa aaaaaaaaaa aaaaaa 1426 

52 <210> SEQ ID NO: 2 

53 <211> LENGTH: 326 

54 <212> TYPE: PRT 

55 <213> ORGANISM: Zea mays 

57 <400> SEQUENCE: 2 

58 Met Glu His Thr Lys Ser Gly Pro Ser Ser Trp Pro Glu Leu Ala Asp 

59 1 5 10 15 
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RAW SEQUENCE LISTING DATE: 09/19/2001 

PATENT APPLICATION: US/09/786 , 675A TIME: 10:20:21 

Input Set : A:\BB-124 0 Seq List - Corrections.txt 
Output Set: N:\CRF3\09192001\I786675A.raw 
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98 210 215 220 

100 Leu Lys Gly Leu Tyr Lys Gly Glu Asn Asn Leu Leu Val Glu Asp Glu 

101 225 230 235 240 

103 Arg He Ser Ala Val Cys Phe Lys Val Leu Lys Asn Asp Trp Thr Cys 

104 245 250 255 

106 Val Phe Ala Leu Ser Leu Leu Leu Asp Leu Leu Cys Thr Gly Leu Gin 

107 260 265 270 

109 Pro Ser Asp Glu Leu Arg Ser Thr Leu Glu Thr He Arg Ser Ser His 
HO 275 280 285 

112 Pro Glu Thr Ala Asp Asp Asp Pro Ala Ala Ala Val Cys Cys He Leu 

113 290 295 300 

115 Gin Lys Cys Asp Pro Leu Arg Val Asn Tyr Trp Ser Trp Phe Lys Asp 

116 305 310 315 320 

118 Thr Leu Ser Gin He Ser 

119 325 

121 <210> SEQ ID NO: 3 

122 <211> LENGTH: 1218 

123 <212> TYPE: DNA 

124 <213> ORGANISM: Oryza sativa 

126 <400> SEQUENCE: 3 

127 gcacgaggtt ctaacgccgc cgccgccgcc gccgtctccg cagaatctga tegatggege 60 

128 cgtcgtcgac gtegteggag ggtgcctccg acgagtggtt gccacccagc cggcggccgg 120 

129 agctggcgga cgtggtcccc gtgaegcagg acgacgggcc ccaccccgtg gtggccatcg 180 

130 cctaccggga cgagttccgc gaggtcatgg actacttccg cgccctctac ttcgccggcg 240 
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204 260 265 270 

206 Leu Leu Asp Leu Leu Gin He Gly Leu Gin Pro Ser Asp Glu 

207 275 280 285 

209 Gly Thr He Glu Ala He Lys Asn Ser Asp Pro Glu Ala Asp 

210 290 295 300 

212 Val Asp Ala Asp Leu Ala Thr Ala He Cys Ser He Leu Gin 

213 305 310 315 

215 Asp Pro Leu Arg He Asn Tyr Trp Ser Trp Tyr Arg Thr Thr 

216 325 330 
218 Ser Gin Thr 

221 <210> SEQ ID NO: 5 

222 <211> LENGTH: 1261 

223 <212> TYPE: DNA 

224 <213> ORGANISM: Glycine max 

226 <400> SEQUENCE: 5 

227 gcacgaggat taacgaagga tggaatctgg gtctagcgaa ggagaagagg tgcagcaacg 60 

228 cgtgccgttg agggagagag tggagtggtc agatgttact ccggttcctc aaaacgacgg 120 

229 ccctaaccct gtcgttccga tccagtacac tgaagagttt tccgaagtta tggattactt 180 

230 tcgcgccgtt tacctcaccg atgaacgctc ccctcgcgcc ctcgctctca cagccgaagc 240 

231 cgttcaattc aactccggca actacactgt gtggcatttc cgacggttgt tacttgagtc 300 

232 gctaaaagtc gacttgaacg atgaactgga ttttgtggag cgtatggccg ctggaaattc 360 

233 taaaaattat cagatgtggc atcatagacg atgggttgcc gagaagttag gtcctgaagc 420 

234 tagaaacaat gagctcgagt tcaccaaaaa gatactgtcc gttgatgcca aacattatca 480 

235 tgcatggtct catagacagt gggctcttca aacactagga ggatgggaag atgaacttaa 540 

236 ttattgcaca gaactactta aagaagacat ttttaacaat tctgcttgga atcagagata 600 

237 ttttgtcata acaaggtctc ctttcttggg gggcctaaaa gctatgagag agtctgaagt 660 

238 gctttacacc attgaagcca ttatagccta ccctgaaaat gaaagctcgt ggagatatct 720 

239 acgaggactt tataaaggtg aaactacttc atgggtaaat gatcctcaag tttcttcagt 780 

240 atgcttaaag attttgagaa ctaagagcaa ctacgtgttt gctcttagca ctattttaga 840 

241 tcttatatgc tttggttatc aaccaaatga agacattaga gatgccattg acgccttaaa 900 

242 gaccgcagat atggataaac aagatttaga tgatgatgag aaaggggaac aacaaaattt 960 
24 3 aaatatagca cgaaatattt gttctatcct aaaacaagtt gatccaatta gaaccaacta 1020 
244 ttggatttgg cgcaagagca gacttcctct atcagcttag taaccaaagt aattaaaggg 1080 
24 5 caactctgtg ttatgtgtaa cctagtttat tgaaactgga tttttattta ttattatttt 1140 
24 6 ttatgttgtc atgtatctgt ttgtgcaaat ttatcttttt gtcatgccat tactggcatt 1200 
24 7 tgagtgtaag gattgaaagc catgcagaat aagaaattta agtttttttt tccgttgaaa 1260 
248 a 1261 

250 <210> SEQ ID NO: 6 

251 <211> LENGTH: 346 

252 <212> TYPE: PRT ^ v 
2 53 <213> ORGANISM: G lycine max \> «rV^ 
255 <400> SEQUENCE: 6 ' ~ " 



256 Met Glu Ser Gly Ser Ser Glu Gly Glu Glu Val Gin Gin Arg Val Pro 



The type of errors shown exist throughout 
the Sequence Listing. Please check subsequent 
sequences for similar errors. 
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320 340 345 

322 <210> SEQ ID NO: 7 

323 <211> LENGTH: 1333 

324 <212> TYPE: DNA 

325 <213> ORGANISM: Glycine max 

327 <400> SEQUENCE: 7 

328 gcacgagctt gcgtgtggag tgaagaagat taacgaagga tggaatctgg gtctagcgaa 60 
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